In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions

نویسندگان

  • Hideki Banno
  • Tetsuya Shinde
  • Kazuya Takeda
  • Fumitada Itakura
چکیده

In this paper, we describe a multichannel method of noisy speech recognition that can adapt to various in-car noise situations during driving. The method allows us to estimate the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by multiple distributed microphones. Through clustering of the spatial noise distributions under various driving conditions, the regression weights for MRLS are effectively adapted to the driving conditions. The experimental evaluation shows an average error rate reduction of 43 % in isolated word recognition under 15 different driving conditions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing regression for in-car speech recognition using multiple distributed microphones

In this paper, we address issues in improving handsfree speech recognition performance in different car environments using multiple spatially distributed microphones. In previous work, we proposed multiple regression of the log-spectra (MRLS) for estimating the logspectra of speech at a close-talking microphone. In this paper, the idea is extended to nonlinear regressions. Isolated word recogni...

متن کامل

CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition

This paper introduces a common database and an evaluation framework for connected digit speech recognition in real driving car environments, CENSREC-2, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. Speech data of CENSREC-2 was collected using two microphones, a close-talking microphone and a hands-free microphone, under three car speeds and four car conditions...

متن کامل

Multiple Regression of Log-spectra Fo

This paper describes a new multichannel method of noisy speech recognition, which estimates the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by the distributed microphones. Since the method does not assume the arrangement of sound sources and microphones, it can be applied to in-car speech recognition d...

متن کامل

CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments

This paper introduces a common database, an evaluation framework, and its baseline recognition results for in-car speech recognition, CENSREC-3, as an outcome of the IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. CENSREC-3, which is a sequel to AURORA-2J, has been designed as the evaluation framework of isolated word recognition in real car-driving environments. Speech data wer...

متن کامل

Detection of Local Disturbances and Simultaneously Active Speakers for Distributed Speaker-Dedicated Microphones in Cars

For automotive hands-free and speech recognition applications, distributed microphones are often mounted in the car where each of the speakers has a dedicated microphone close to his position. To provide additional control information for further speech enhancement, it is often advantageous to distinguish between the activity of the different passengers. In this contribution speaker activity is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003